Halting viruses in scale-free networks 
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The vanishing epidemic threshold for viruses spreading on scale-free networks indicate that tradi- 
tional methods, aiming to decrease a virus' spreading rate cannot succeed in eradicating an epidemic. 
We demonstrate that policies that discriminate between the nodes, curing mostly the highly con- 
nected nodes, can restore a finite epidemic threshold and potentially eradicate a virus. We find that 
the more biased a policy is towards the hubs, the more chance it has to bring the epidemic threshold 
above the virus' spreading rate. Furthermore, such biased policies are more cost effective, requiring 
less cures to eradicate the virus. 



While most diffusion processes of practical interest, 
ranging from the spread of computer viruses to the diffu- 
sion of sexually transmitted diseases, take place on com- 
plex networks, the bulk of diffusion studies have focused 
on model systems, such as regular lattices or random 
networks 0,0]. A series of recent results indicate, how- 
ever, that real networks significantly deviate from the 
structure of these model systems M - deviations that 
have a strong impact on the diffusion dynamics as well. 
In particular, the networks responsible for the spread of 
computer viruses, such as the Internet Ml or the email 
network ||, have a scale-free topology |J, exhibiting 
a power-law degree distribution P{k) ~ kr 1 where 7 
ranges between 2 and 3. Similarly, a recent study indi- 
cate that the social network responsible for the spread of 
sexually transmitted diseases, such as AIDS, also exhibits 
a scale- free structure . The topology of scale-free net- 
works fundamentally deviate from the topology of both 
regular lattices and random networks [pi , differences that 
impact the network's robustness and attack tolerance ||] 
or the dynamics of synchronization |l0|. It is not un- 
expected, therefore, that the broad degree distribution 
leads to unexpected diffusion properties as well pi[ . 

A simple model often used to study the generic features 
of virus spreading is the susceptible-infected-susceptible 
(SIS) model. In this model an individual is represented 
by a node, which can be either "healthy" or "infected". 
Connections between individuals along which the infec- 
tion can spread are represented by links. In each time 
step a healthy node is infected with probability v if it is 
connected to at least one infected node. At the same time 
an infected node is cured with probability 5, defining an 
effective spreading rate A = K for the virus. 

The behavior of the SIS model is well understood if the 
nodes are placed on a regular lattice or a random network 
|ij . Diffusion studies indicate that viruses whose spread- 
ing rate exceeds a critical threshold A c will persist, while 
those under the threshold will die out shortly. Recently, 
however, Pastor-Satorras and Vespignani have shown |Tl]] 
that for scale-free networks with 7 < 3 the epidemic 
threshold vanishes, i.e. A c = 0. This finding implies 
that on such networks even weakly infectious viruses can 
spread and prevail. This vanishing threshold is a conse- 



quence of the hubs - nodes with a large number of links 
encoded by the tail of power law P(k). Indeed, the hubs 
are in contact with a large number of nodes, and are 
therefore easily infected. Once infected, they pass on the 
virus to a significant fraction of the nodes in the system. 

The finding that the epidemic threshold vanishes in 
scale-free networks has a strong impact on our ability 
to control various virus outbreaks. Indeed, most meth- 
ods designed to eradicate viruses - biological or com- 
puter based - aim at reducing the spreading rate of the 
virus, hoping that if A falls under the critical threshold A c , 
the virus will die out naturally. With a zero threshold, 
while a reduced spreading rate will decrease the virus' 
prevalence, there is little guarantee that it will eradi- 
cate it. Therefore, from a theoretical perspective viruses 
spreading on a scale-free network appear unstoppable. 
The question is, can we take advantage of the increased 
knowledge accumulated in the past few years about net- 
work topology to understand the conditions in which one 
can successfully eradicate viruses? 

Here we study the spreading of a virus to which there 
is a cure, eradicating the virus from the node to which 
it is applied to, but which does not offer a permanent 
protection against the virus. If such cure is available to 
all nodes, treating simultaneously all infected nodes will 
inevitably wipe the virus out. However, due to economic 
or policy considerations the number of available cures is 
often limited. This applies to AIDS, for which relatively 
effective but prohibitively expensive cures are available, 
unable to reach the most affected segments of population 
due to economic considerations Jlifl . But it also applies 
to computer viruses, where only a small fraction of users 
commit the time and investment to update regularly their 
virus protection system. We show that distributing the 
cures randomly in a scale-free network is inneffective, be- 
ing unable to alter the fundamental properties of the 
threshold-free diffusion process. However, even weakly 
biased strategies, that discriminating between the nodes, 
curing with a higher probability the hubs than the less 
connected nodes, can restore the epidemic threshold. We 
find that such hub-biased policies are more cost-effective 
as well, requiring fewer cures than those distributing the 
cures indiscriminately. 



Curing the hubs: The vanishing epidemic threshold of 
a virus spreading in a scale-free network is rooted in the 
infinite variance of the degree distribution Jll[ ] . Indeed, 
the threshold A c depends on the variance as 



A f: 
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On a regular lattice the degree distribution is a delta 
function, while on a random network it follows a Poisson 
distribution, in both cases resulting in a finite < k 2 >, 
and therefore nonzero A c . In contrast, if the virus spreads 
on a scale-free network, for which -P(fc) follows a power 
law with 7 < 3, the variance is infinite and the epidemic 
threshold is A c = 0. Therefore, to restore a finite epi- 
demic threshold, which would allow the infection to die 
out, one needs to induce a finite variance. As the origin 
of the infinite variance is in the tail of the degree distri- 
bution, dominated by the hubs, one expects that curing 
all hubs with degree larger than a given degree fco would 
restore a finite variance and therefore a nonzero epidemic 
threshold. Indeed, if on a scale-free network nodes with 
degree k > feg are always healthy, the epidemic threshold 
is finite and has the value Jl3|] 
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Short of a detailed network map, no method aiming to 
identify and cure the hubs is expected to succeed at its 
goal of finding all hubs with degree larger than a given fc . 
Yet, policies designed to eradicate viruses could attempt 
to identify and cure as many hubs as possible. Such bi- 
ased policy will inevitably be inherently imperfect, as it 
might miss some hubs, and falsely identify some smaller 
nodes as hubs. The question is, however, would a pol- 
icy biased towards curing the hubs, without a guarantee 
that it can identify all of them, succeed at restoring the 
epidemic threshold? 

To investigate the effect of incomplete information 
about the hubs we assume that the likelihood of identi- 
fying and administering a cure to an infected node with 
k links in a given time frame depends on the node's de- 
gree as k a , where a characterizes the policy's ability to 
identify hubs. In this framework a = corresponds to 
random cure distribution, which is expected to have zero 
epidemic threshold while a — oo corresponds to an op- 
timal policy that treats all hubs with degree larger than 
fco- Within the framework of the SIS model we assume 
that each node is infected with probability v, but each in- 
fected node is cured with probability 8 = 5$k a , becoming 
again susceptible to the disease. We define the spreading 



rate as A 



As each healthy node is susceptible again 



to the disease, a node can get multiple cures during a 
simulation. 



FIG. 1. The epidemic threshold as a function of fco- 



This expression indicates that the more hubs we cure 
(i.e. the smaller fco is), the larger the value of the epi- 
demic threshold (Fig. 1) ]14| . Therefore, the most ef- 
fective policy against an epidemics would be to cure as 
many hubs as economically viable. The problem is that 
in most systems of interest we do not have detailed net- 
work maps, thus we cannot effectively identify the hubs. 
Indeed, we do not know the number of sexual partners for 
each individual in the society, thus we cannot identify the 
social hubs that should be cured if infected. Similarly, on 
the email network we do not know which email accounts 
serve as hubs, as these are the ones that, for the benefit of 
all email users, should always carry the latest anti-virus 
software. 
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FIG. 2. Prevalence, p, measured as the fraction of in- 
fected nodes in function of the effective spreading rate A 
for a = 0(O), 0.25(D), 0.50(V), 0.75(0) and 1(A), as pre- 
dicted by Monte-Carlo simulations using the SIS model on a 
scale-free network with N=10,000 nodes. 



We place the nodes on a scale-free network |l5| and 
initially infect half of them. After a transient regime the 
system reaches a steady state, characterized by a con- 
stant average density of infected nodes, p, which depends 
on both the spreading rate A and a (Fig. 2). The a = 
limit corresponds to random immunization in which case 
the epidemic threshold is zero. As treating only the hubs 
will restore the nonzero epidemic threshold, for a = oo 
we expect a nonzero A c . Yet, the numerical simulations 
indicate that we have a finite A c well before the a = oo 
limit. Indeed, as Fig. 2 shows, A c is clearly finite for 



a = 1 and so is for smaller value of a as well. The nu- 
merical simulations do not give an unambiguous answer 
the crucial question: Is there a critical value of a at which 
a finite A c appears, or for any any nonzero a we have a 
finite A c ? 

Mean-field theory: To interpret the results of the nu- 
merical simulations we studied the effect of a biased pol- 
icy using the mean-field continuum approach fll|,|llf . De- 
noting by pkif) the density of infected nodes with con- 
nectivity k, the time evolution of Pk(t) can be written as 

EU 



dtpkit) = -S k a p k (t) + i/(l - Pk (t))k6(\). 



(3) 



The first term in the r.h.s. describes the probability that 
an infected node is cured, and it is therefore proportional 
to the number of infected nodes p k (t) and the probability 
5ak a that a node with k links will be selected for a cure. 
The second term is the probability that a healthy node 
with k links is infected, proportional to the infection rate 
(v), the number of links (fc), the number of healthy nodes 
with k links (1 — Pk(t)), and the probability 9(X) that a 
given link points to an infected node. The probability 
0(A) is proportional to fcP(fe), therefore it can be written 
as 



0(A) = £ 



kP(k) 
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(4) 



Using X — j- and imposing the dtPk(t) = stationary 
condition we find the stationary density as 



Pk = 



A0(A) 



A0(A) 



(5) 



Combining Eq.(4) and Eq.(5) and using the fact that the 
connectivity distribution P(k) = 2m 2 /k~ 3 for the scale- 
free network B , we obtain: 



(A 



dk 



= 1. 



fc 2 (fc Q - 1 + A6»(A)) 
The average density of infected nodes is given by 

dk 



(6) 
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p(X) =£p(fc)p(jfc) = 2m 2 X6(X) / 

i. J m 



k 3 (k c 
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Equations (6) and (7) allow us to calculate the average 
density of infected nodes for any value of a. For a = 
they reduce to the case studied in Ref. [|llj giving A c = 0. 
For a = 1 we can solve (6), and using (7) we obtain 



p{X)\ a=1 = 



A-l 



(8) 



which indicates that for a = 1 the epidemic threshold is 
finite, having the value X c (a = 1) = 1 MM. 




FIG. 3. The dependence of the epidemic threshold A c on a 
as predicted by our calculations (continuous line) based on the 
continuum approach, and by the numerical simulations based 
on the SIS model (boxes). The small deviation between the 
numerical results and the analytical prediction is due to the 
uncertainty in determining the precise value of the threshold 
in Monte-Carlo simulations. 



To determine the epidemic threshold as a function of 
a we need to solve the p(X) = equation. While we 
cannot get p(X) for arbitrary values of a, we can solve 
Eq.(6) in A using that at the threshold A = A c we have 
6(X C ) = 0. In this case (6) predicts that the epidemic 
threshold depends on a as 



A r 



am 



(9) 



For a = 0we recover A c = 0, confirming that random im- 
munization cannot eradicate an infectious disease. For 
a = 1 Eq.(9) predicts that the epidemic threshold is 
A c = 1, in agreement with (8). Most important, however, 
Eq.(9) indicates that A c is nonzero for any positive a, i.e., 
any policy that is biased towards curing the hubs can re- 
store a finite epidemic threshold. Furthermore, policies 
with larger a are expected to be more likely to lead to 
the eradication of the virus, as they result in larger A c 
values. Therefore, Eq.(9) indicates that a potential av- 
enue to eradicating a virus is to increase the effective- 
ness of identifying and curing the hubs. Indeed, if the 
virus has a fixed spreading rate, increasing a could in- 
crease A c beyond A, thus making possible for the virus 
to die out naturally. To test the validity of prediction 
(9) we determined numerically the X(a) curve from the 
simulations shown in Fig. 2. As Figure 3 shows, we find 
excellent agreement between the simulations and the an- 
alytical prediction (9). 

Cost-effectiveness: A major criteria for any policy de- 
signed to combat an epidemic is its cost-effectiveness. 
Supplying cures to all nodes infected by a virus is of- 
ten prohibitively expensive. Therefore, policies that ob- 
tain the largest effect with the smallest number of ad- 
ministered cures are more desirable. To addres the cost- 
effectiveness of a policy targeting the hubs we calculated 
the number of cures administered in a time step per 
node for different values of a. Figure 4 indicates that 



increasing the policy's bias towards the hubs by allow- 
ing a higher value for a decreases rapidly the number 
of necessarily cures. Therefore, policies that distribute 
the cures mainly to the nodes with more links are more 
cost effective than those that spread the cures randomly, 
blind to the node's connectivity. We can understand the 
origin of the rapid decay in c(a) by noticing that the 
number of cures administered per unit time is propor- 
tional to the density of infected nodes. From Fig. 2 we 
see that for a given value of the spreading rate the preva- 
lence is decreasing as a increases, decreasing the number 
of necessarily cures as well. 
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FIG. 4. The number of cures, c, administered in an unit 
time per node for different values of a. The rapidly decaying 
c indicates that more successful is a policy in selecting and 
curing hubs (larger is a), fewer cures are required for a fixed 
spreading rate (A = 0.75). For a — the number of cures is 
calculated by c = vj{y + S) = A/(l + A) which gives c = 0.43, 
which value is in good agreement with the numerical results. 



In summary, our numerical and analytical results in- 
dicate that targeting the more connected infected nodes 
can restore the epidemic threshold, therefore making pos- 
sible the eradication of a virus. Most important, however, 
is the finding that even moderately successful policies 
with small a can lead to a nonzero epidemic threshold. 
As the magnitude of A c rapidly decreases with a, the 
more effective a policy is at identifying and curing the 
hubs of a scale-free network, the higher are its chances 
of eradicating the virus. Finally, the simulations shows 
that a biased treatment policy is not only more efficient 
but it is also less expensive than random immunization. 
These results, beyond improving our understanding of 
the basic mechanisms of virus spreading, could also offer 
important input into designing effective policies to erad- 
icate computer or biological infections. 
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